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Abstract 

We present the partial branching fraction for inclusive charmless semileptonic B decays and the 
corresponding value of the CKM matrix element \V u b\, using a multivariate analysis method to 
access ~90% of the B — y X u £u phase space. This approach dramatically reduces the theoretical 
uncertainties from the b— quark mass and non-perturbative QCD compared to all previous inclusive 
measurements. The results are based on a sample of 657 million BB pairs collected with the Belle 
detector. We find that AB{B -> X u £u;p* e B > 1.0 GeV/c) = 1.963 x (l±0.088 8ta t. ±0.081 sys .) x 10" 3 . 
Corresponding values of \V u b\ are extracted using several theoretical calculations. 

PACS numbers: 12.15.Hh, 11.30.Er, 13.25.Hw 
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The comparison of the Cabibbo-Kobayashi-Maskawa (CKM) matrix element \V ub \ [jlll , 
which determines one of the sides of the Unitarity Triangle, and the CP angle <p 1 yO is one 
of the crucial tests of the Yukawa sector of the Standard Model. The angle 4>i is directly 
sensitive to potential CP-violation beyond the Standard Model and can be measured with 
high precision without theoretical input. In contrast, \V ub \ is insensitive to new physics 
as it is determined by tree-level weak decays but relies on input from theoretical QCD 
calculations. Currently measurements of X and \ V ub \ are not entirely consistent [fsh - 

Experimentally \V ub \ can be measured from semileptonic decays either using an exclu- 
sive hadronic final state, or by considering the inclusive rate, summing over all hadronic 
final states subject to some kinematical constraints. The two approaches involve different 
experimental and theoretical tools. Currently there is a ~ 2a discrepancy [0] in the respec- 
tive world averages, indicating an incomplete understanding of these tools. The extraction 
of \ V ub \ is a challenge for both theory and experiment. The primary difficulty in measuring 
\V ub \ with high precision in inclusive B — > X u iv decays is suppressing the background from 
B — > XJv, which is 50 times larger. All measurements to date have applied kinematic se- 
lection criteria to achieve separation from B — > X c iu decays. These required the use of 
the theoretical parameterizations called shape functions (SF) to describe the unmeasured 
regions of phase space. Here we report a measurement of the partial branching fraction 
of B — > X u £u decays with a lepton momentum threshold of 1 GeV/c using a multivariate 
data mining technique, and derive values of \V U b\ using several theoretical calculations 
• This approach accesses ~90% of the B — > X u £u phase space, the greatest reach of 
any inclusive \V ub \ measurement llsl- fioh . This is a major milestone in the measurement of 
\V ub \, as we can rely on the well tested theory used to describe B — > X c iv transitions, mini- 
mizing the dependence on a SF. Thus this is the single most precise determination of \y ub \, 
and provides a valuable new direction for \V ub \ determinations by addressing previously 
irreducible theoretical uncertainties. 

The measurement is made by fully reconstructing one B meson (-B tag ) decaying to a 
fully hadronic final state, and identifying the semileptonic decay of the other B meson 
(-B sig ) by the presence of a high momentum electron or muon. The data were collected 
with the Belle detector nilfl at the asymmetric-energy KEKB e + e collider [1 12f1 . The results 
presented in this Letter are based on a sample of 657 x 10 6 BB pairs collected at the T(4S) 
resonance (on-resonance). An additional 68 fb 1 data sample taken at 60 MeV below 
the T(45) resonance (off-resonance) is used to perform subtraction of background arising 
from the continuum e + e~ — > qq process (q = u, d, s, c). 

The £?ta g candidates are reconstructed following the procedure of Ref. lfl3h . in hadronic 
modes that determine their charge, flavour, and momentum. For each selected candidate, 
we calculate the beam-energy constrained mass, M bc = y/ (E^ cam ) 2 — (p* B ) 2 , and the en- 
ergy difference, AE = E* B — -E beam , where E^ cam , p* B and E* B are the beam energy, the 
reconstructed B momentum and the reconstructed B energy in the T(AS) rest frame, 
respectively. In events containing multiple B meson candidates, the candidate with the 
smallest yj, defined in Ref. [13]. The B tag purity of this sample is 25% (30%) for B + 
(B°) tags Q. Events with M bc e (5.27,5.2_9) GeV/c 2 , \AE\ < 0.05 GeV, and X 2 B < 10 
are considered for further analysis. True BB events for which the reconstruction of B t3ig 
is not correct are treated as background (referred to as combinatorial background) . This 
background peaks in the signal region of M bc . We derive the shape of the combinato- 
rial background from Monte Carlo (MC) as in Ref. I115n . with the yield normalized to the 
on— resonance data M bc sideband (M hc e (5.20, 5.25) GeV/c 2 ) after the subtraction of non- 
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BB (continuum) backgrounds. The continuum background is scaled by the integrated on- 
to off-resonance luminosity ratio, taking into account the cross-section difference. There 
are 1167329 ± 5412 stat . B candidates in the signal region (N tag ), after continuum and com- 
binatorial background subtraction. 

Electron and muon candidates decaying from B sig are required to originate from near 
the interaction vertex and pass through the barrel region of the detector, corresponding 
to an angular acceptance of #i ab G (35°, 125°) (6\ ah € (25°, 145°)) for electrons (muons), 
where 6> lab denotes the polar angle of the lepton candidate with respect to the direction 
opposite to the positron beam. We exclude tracks used in the reconstruction of the B tag 
and multiple reconstructed tracks generated by low-momentum particles spiraling in the 
drift chamber. We consider the lepton with the highest momentum in the B rest frame 
to be prompt. The lepton identification efficiencies and the probabilities to misidentify 
a pion, kaon or proton as a lepton have been measured as a function of the laboratory 
momentum and angles. The average electron (muon) identification efficiency and hadron 
misidentification rate are 97% (90%) and 0.7% (1.4%), respectively, over the full phase 
space. In B + tagged events, we require the lepton charge to be consistent with a prompt 
semileptonic decay of B sig . In B° events, we make no requirement on the lepton charge. 
For semileptonic B decays to electrons, we partially recover the efficiency loss due to 
bremsstrahlung as in Ref. [15]. The lepton momenta are calculated in the B meson rest 
frame (p} B ). Events with leptons from J/ifj decays, photon conversions, and tt° decays 
are rejected using the invariant mass of prompt lepton candidates in combination with an 
oppositely charged lepton; for electron candidates additional photons are included in the 
veto calculation. 

The B — > X u iv selection criteria are based on a non-linear multivariate analysis tech- 
nique, the Boosted Decision Tree (BDT) method dial , which takes into account various 
observables to form one event classification variable. A total of 17 discriminating vari- 
ables are used to form a BDT classifier, separating B — > X u iv decays from other kinds of 
B decays. These include quantities based on: the kinematics of the candidate semilep- 
tonic decay; discrete quantities such as the number of kaons; and quantities correlated 
to the quality of the event reconstruction, such as M bc . A description of the highest 
discriminating quantities follows. The absolute value of event net charge is found to 
be correlated to track multiplicity, which tends to be higher for b — > c transitions. The 
kinematic variables associated to the hadronic current, M x and P + (invariant mass, and 
energy-momentum of the hadronic system, X u , respectively) are calculated from the mea- 
sured momenta of all charged tracks and neutral clusters that are not associated to _B tag 
reconstruction or used as lepton candidates. The lepton current four-momentum is calcu- 
lated as q = pr(4s) — PB tag — Px- Missing momentum attributed solely to prompt neu- 
trinos should have a missing mass consistent with zero. Thus we calculate the miss- 
ing mass squared, m miss , of the events from the missing four-momentum P m i SS . The 
missing momentum is estimated from the four-momenta of the tag-side B and all re- 
constructed charged particles and photons that pass selection criteria on the signal side: 
Pmiss = Px ( 45) - Ps tag - Echargcd p ~ Encutrai p - To reduce contamination from B D*iv 
events, we search for low momentum pions from D* + — > D°tt + and calculate the momen- 
tum of the D* + and missing mass squared, rrv^- K , n ^ = (Pe sig — Pd* — Pi) 2 - The presence 
of kaons in semileptonic B meson decay is usually an indication of a b — > c transition, 
although b — >■ u decays with kaons from ss popping in the final state have been observed. 
Such decays are far less abundant than the charm cascade production of kaons, thus the 
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number of charged kaons and K$ mesons are considered in the multivariate analysis. We 
set an event selection threshold criterion for the BDT-classifier that is optimized with re- 
spect to both the systematic uncertainty from the background normalization fit and phase 
space dependent theoretical uncertainties. We set a lower threshold onp} B of 1.0 GeV/c. 

The backgrounds that remain after the BDT selection criteria are subtracted as de- 
scribed below. The continuum and combinatorial backgrounds follow the N B § determina- 
tion procedure described earlier in this Letter. All remaining backgrounds arise when the 
fully reconstructed B is correctly tagged, but the decay is either a charmed semileptonic B 
decay a secondary decay process that produced a high momentum lepton or is a misiden- 
tified hadron. The shapes of the charmed semileptonic B decay contribution, described 
in detail in Ref. I115n . and the secondary contribution, are determined from MC simula- 
tion. We estimate the overall normalization of these remaining backgrounds by fitting the 
observed inclusive spectra to the sum of the MC simulated signal and background contri- 
butions, after continuum and combinatorial background subtraction. There are three free 
parameters in the fit, corresponding to the yields of: B — > X u £u; B — > X c £u; and secon- 
daries and fakes. The fit is performed in two dimensional bins of M x versus q 2 for 4684±85 
input events, with a lepton momentum requirement of p* B > 1.0 GeV/c. The fit results in 
a good agreement between data and MC, with a \ 2 of 24 for 17 degrees of freedom (Fig- 
ure [T]). A total of 1032 ± 91 events remain after background subtration. We measure the 
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FIG. 1: Projections of the M x - q fit in bins of M x (left) and q 2 (right). 

partial branching fractions, combining the spectra from B + and B° semileptonic decays 
with the 1.0 GeV/c lepton momentum threshold. The expression for the partial branching 
fraction is AB = (A^J(2e^JV tag ))(l - 5 rad ), where N b % u and e^ u are the signal yield 
and signal efficiency for the region, A {p* B > 1.0 GeV/c), N tag is the number of tagged 
B events and 5 rad denotes QED corrections. The overall efficiency is 22.2%, determined 
from the fully reconstructed signal MC, reweighted at the generator level in bins of p e , P + , 
M x and q 2 following the prescription in this Letter. The QED correction is 1.4% of the 



branching fraction, obtained using Ref. I117n . The various contributions to the systematic 
error on the partial branching fraction are described below. 

To estimate the particle identification and reconstruction uncertainties, events with 
electrons and muons are reweighted and kaons, pions and photons are randomly removed 
according to their respective measured uncertainties. 
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The MC sample used for the signal B — > X u £u events is a hybrid mix of inclusive 
and exclusive contributions. Resonant semileptonic B decays to tt, and p and u modes 
are modeled with form factors calculated in Ref. lfl8h and Ref. lfl9h . respectively with 
branching fractions set to the world averages • Decays to 77 and 7/ have form factors 
derived from Ref. lf2~ol l and branching fractions set to the world averages [4]. The form 
factors and branching fractions of the unmeasured resonant components are predicted 
by Ref. I120n . The branching fractions of the resonant B — > X u £v final states have been 
varied by ±10% (vr), ±20% (p), ±30% (cu), ±50% (77) and ±100% (//) [H]. The relative 
contribution of the unmeasured components of the hybrid model MC are varied within 
the limits of the full inclusive branching fraction. The inclusive part of the mix uses a 
model based the SF parameterization in Ref. ll2~lh . The hybrid MC is corrected to match 
the moments of the q 2 and M x distributions as predicted by the model in Ref. [0] • The 
uncertainty in the inclusive component is determined by taking into account the error on 
the SF parameters and the theoretical and intrinsic uncertainties in the models in Refs. [Q, 



2 lfl . We estimate the uncertainty due to the simulation of kaon production in B — > X u l 



decays (i.e. gluon splitting into an ss pair), by varying the contribution of events with a 
kaon by 25%. 

Systematic errors in the subtraction of the non-BB background are dominated by the 
uncertainty in the relative normalization of the on- and off-resonance data, which is es- 
timated to be a 1% error on the continuum yield. The uncertainty due to mis-tagging is 
estimated by varying the lower bound on the M bc signal region, corresponding to a 10% 
variation in the ratio of good tags to incorrect tags in the signal region Olal . 

The systematic uncertainty due to the overall fit to data for the background contri- 
bution normalization is estimated by varying the number of bins used in the fit. The 
uncertainty due to secondary, cascade B — > D — > e decays is assessed by varying the 
branching fractions of semileptonic D decays, and B — > D anything by ±1<t Q22|] . The 
uncertainty associated to the magnitude of the hadron fake contribution is determined 
from measurements of Jig — > 7t + tt^ decays. 

To model backgrounds from B — > D£v and B — > D*tv decays we use parameterizations 
of the form factors based on heavy quark effective theory 02 3^2 ah The B — > Dtv and 
B — > D*£u decay slope parameters are set to the world averages D4D. The B — > D*iv de- 
cay parameters, Ri and R 2 are set to the most recently measured values [0] . The branching 
fractions of the D and D* components are based on Ref. Q22|] . For higher mass D** reso- 
nances we use the model in Ref. 026n with the method described in Ref. [15]. We adopt 
the prescription of Ref. 02711 for the non-resonant B — > D^iriv decay shapes. The normal- 
ization of the narrow resonant D** and non-resonant D*n components are based on values 
in Ref. |]il]. The remaining unmeasured contribution is matched to the full inclusive rate. 
To estimate the sensitivity to the rates of the exclusive B — > X c £u modes, we adjust their 
individual branching fractions about their measured uncertainties. To test the sensitivity 
to the shape of these contributions, we have varied the form factors for D*£u, and D£v 
decays about their measured uncertainties, and changed the model input parameters that 
describe the differential decay rates of the resonant D**iv decays. For the resonant D**£v 
decays, we take into account limits from measurements to resonant and non-resonant 
D^-nlv states, and full inclusive rates iRl fl3l l2~2h . The systematic uncertainty on the non- 
resonant D^txIv decay modes is estimated as half of the shift between the bounds on 
the branching fractions. The simulation of QED corrections incurs a negligible systematic 
error 015[1 . 
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We estimate the effect of the specific choice of parameters used in training the BDT by 
varying the pruning technique, the number of trees, and the minimum number of events 
in each node by 20% for each respective quantity. 

The partial branching fraction for p* e B > 1.0 GeV/c is AB(B — > X u £is;p} B > 
1.0 GeV/c) = 1.963 x (1 ± 0.088 stat . ± 0.081 sys .) x 10~ 3 . A breakdown of the uncertain- 
ties is provided in Table IB We obtain \V ub \ directly from the partial branching fraction 

TABLE I: Uncertainties in the partial charmless semileptonic branching fraction (in percent). 
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3.1 
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8.1 
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using |Kb| 2 = ^Butv/ (j~ B /S.1Z), where A7£ is the predicted B — > X u tv partial rate in the 
given phase space region, and r B is the average B lifetime [0]. Table [H] lists |14&| values 
extracted with the most recent QCD calculations [Hl-@], where the errors are statistical, 
systematic, from the error on m b , and theoretical, respectively. Within their stated theo- 
retical uncertainties, the results in Table [II] are consistent. 



TABLE II: Values for \V u b\ with relative errors (in %). 
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BLNP [ 
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GGOU [0] 


4.41 
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In summary, a new experimental technique has been employed to measure the branch- 
ing fraction of inclusive charmless semileptonic B decays (B — > X u iu) over nearly the 
full kinematical phase space, resulting in a large reduction on the uncertainty of \V ub \. 
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We provide a more reliable, comprehensive treatment of the many contributions to the 
signal and background processes, while reducing the experimental (non-model) system- 
atic errors on \V U b\ by ~ 30%, with respect to Ref. [[8Q. The theoretical uncertainties on 
| V ub \, dominated by the uncertainties on the 6-quark mass and the shape function, are 40% 
lower in the schemes in Ref. [6] and [Q] and 20% lower in the scheme in Ref. [5] than 
in our previous measurement [8]. The SF errors have been almost completely removed 
from the theoretical extrapolation. The improvement in the uncertainty is primarily due 
to the increase in the measured phase space, which decreases the power dependence of 
\V ub \ on the 6-quark mass. These values have an overall uncertainty of ~ 7%, competitive 
with that of the world average and in agreement at the ~ la level. This result in- 
creases our confidence in the inclusive determination of \ V U b\, further highlighting the gap 
between the inclusive and the exclusive determinations, and with sin20x. This is the last 
measurement of inclusive \ V ub \ by Belle, using the full T(AS) data sample. 
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